Pathological Voice Detection Using Efficient Combination of Heterogeneous Features
نویسندگان
چکیده
Combination of mutually complementary features is necessary to cope with various changes in pattern classification between normal and pathological voices. This paper proposes a method to improve pathological/normal voice classification performance by combining heterogeneous features. Different combinations of auditory-based and higherorder features are investigated. Their performances are measured by Gaussian mixture models (GMMs), linear discriminant analysis (LDA), and a classification and regression tree (CART) method. The proposed classification method by using the CART analysis is shown to be an effective method for pathological voice detection, with a 92.7% classification performance rate. This is a noticeable improvement of 54.32% compared to the MFCC-based GMM algorithm in terms of error reduction. key words: pathological voice detection, heterogeneous feature combination, mel-frequency filter bank energies, higher-order statistics, pattern classification algorithm
منابع مشابه
Voice pathology detection and classification using MPEG-7 audio low-level features
In this paper, a new pathological voice detection and pathology classification method based on MPEG-7 audio lowlevel features is proposed. MPEG-7 features are originally used for multimedia indexing, which includes both video and audio. Indexing is related to event detection, and as pathological voice is a separate event than normal voice, we show that MPEG-7 audio low-level features can do ver...
متن کاملOn combining information from modulation spectra and mel-frequency cepstral coefficients for automatic detection of pathological voices.
This work presents a novel approach for the automatic detection of pathological voices based on fusing the information extracted by means of mel-frequency cepstral coefficients (MFCC) and features derived from the modulation spectra (MS). The system proposed uses a two-stepped classification scheme. First, the MFCC and MS features were used to feed two different and independent classifiers; and...
متن کاملGreen and efficient synthesis of propargylamines via A3 coupling reaction using a copper (II)–thioamide combination
A one pot green three‐component coupling reaction of aldehyde, phenylacetylene, and amine derivatives in the presence of copper (II)–thioamide combination as a novel and efficient heterogeneous catalyst under solvent–free conditions is reported. The catalyst displayed high activity and afforded the corresponding propargylamines in good to high yields. The key to this procedure was the generatio...
متن کاملOn Feature Extraction for Voice Pathology Detection from Speech Signals
Reliable, automatic and objective detector of pathological voice disorders from speech signals is a long sought-for tool, by voice clinicians as well as by general practitioners. Such a detector can also be used for low-cost and noninvasive mass-screening, diagnosis and early detection of voice pathology for professionals using voice as an essential career tool, for humans working in risky envi...
متن کاملVoice Activity Detection based on Optima Multiple Featu
This paper presents a voice activity detection (VAD) scheme that is robust against noise, based on an optimally weighted combination of features. The scheme uses a weighted combination of four conventional VAD features: amplitude level, zero crossing rate, spectral information, and Gaussian mixture model likelihood. This combination in effect selects the optimal method depending on the noise co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEICE Transactions
دوره 91-D شماره
صفحات -
تاریخ انتشار 2008